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INTRODUCTION 


The writer has presented previously (104, pp. 5-15) a fairly 
comprehensive review of the experimental literature in this field up 
to 1930. The present summary brings this up to 1938, and, as in the 
previous summary, the findings are mostly simply set down under 
" appropriate headings and allowed to speak for themselves. 

Indicative of increased interest in these matters is the fact that 
the former review, covering a much longer period of time, included 
only 147 references, .whereas the present one includes 328. The 
increase in number is striking, even though the figures as given are 
a bit exaggerated, due to the inclusion in the present summary of a 
number of studies published previous to 1930 but not included in the 
original summary, either through oversight or because the present 
summary includes certain aspects not originally considered. 


INTELLIGENCE vs. High ScHooL -GRADES vs. EXAMINATIONS, ETC. 


Although more and“more attention has been paid to factors other 
than intelligence, the importance of this factor is ever with us. 
Various investigators (15, 41, 209, 219, 234, 235, 307) report corre- 
lations with grades ranging from: .33. to .64. At a number of places 
intelligence test scores were found to be the best single predictive 
device for grades (98, 209, 249, 263, 299). 

But there are two other claimants to first honors, namely: high 
school grades, and content examinations of one sort or another. 

High school grades were found to show a higher correlation with 
college grades than did intelligence, or anything else, by a sizable 
number of investigators (64, 79, 85, 121, 124, 234, 235, 268, 310). 
The correlations reported are mostly in the .60’s and .70’s, going 
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as high as .78 (85). Lincoln (148) and Crawford and Burnham (37) 
found high school grades better for prediction than entrance exami- 
nations. Hartson (107) found high school grades and intelligence 
to be of equal value. One study (122) found high school grades 
better than intelligence for predicting failures. In another (85) 
single high school subjects were found to have rather high predictive 
value for average college grades, r’s ranging from .56 to .70. 

There have been a number of investigations based upon rank in 
high school graduating class rather than high school grades. Drake 
and Henmon (56) found high school rank the best single predictive 
measure, and Root (227) found it the best precollege entrance pre- 
dictive measure. In two places (11, 23) high school rank was found 
to be better than, or about equal to, high school grades. In 
another (78) high school rank correlated .69 with grades, which was 
a figure almost twice as large as-the best correlation with the 
Minnesota College Aptitude Test. 

Entrance examinations (1, 35), the Iowa High School Survey 
tests (88), and a composite placement test score (110) have each 
predicted grades better than intelligence tests. In the last case the 
r was .61. At Ohio State (202) the freshman placement tests were 
the best single predictive measure, while in some New York 
studies (62, 178, 283) the State Regents’ examinations average was 
the best single measure. In two studies (7, 169) content examina- 
tions and intelligence tests were found to be about equally valuable. 

English and reading tests alone have, in a number of places, been 
found to have high correlations (r’s = .36 to .60) with total college 
grades (89, 178, 216). At Temple (96) intelligence, high school 
grades, and the cooperative English test were all found to have the 
same predictive value for grades (r = .60). 

In addition to these precollege-entrance criteria, first-semester or 
first-quarter college grades have been found to be the best single 
criterion for prediction of later college grades (15, 200, 226). In 
one case (200) the ry was 81. At Hamline (227) the freshman 
English grade alone turned out to be the best single criterion for 
grades in general (r=.71). In one apparently unique study (112) 
the average college rank of former students coming with a similar 
record from the same high school was found to give better predictive 
results than simple high school rank. 

So far we have discussed single criteria. A good deal of attention 
has been paid to combinations of these same criteria in predicting 
grades, and almost every possible combination has been looked into. 
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High school grades combined with intelligence test score uni- 
formly show higher correlations with grades than does either com- 
ponent separately (79, 122, 124, 209). Some other findings are 
indicated in Table I. 


TABLE I 


CoMBINATIONS OF FAcrors 
Reference Relationship 
No. N Factors to Grades 
108 Over 500 High school grades, intelligence, 


and ratings by principal and 


teacher R=.71 
263 122 High school grades, intelligence, 
and personality ratings 71 
254 (Not given) High school grades, intelligence, .87 
and freshman grades (four years) 
15 250 Second-year high school grades 
and intelligence tested in ele- .67 
mentary school (first semester ) 
52 1200 High school grades and scholastic 
aptitude test .63 
99 756 High school grades and scholastic 
aptitude test 71 
169 583 High school grades and placement 
examination .66 
16 1825 High school grades, intelligence, .63 
and high school rank (freshman ) 
110 (Not given) Intelligence, placement test, and . 66 
Colgate B; scale (freshman ) 
242 149 Reading, English, immediate recall, .70 
and Beta-type tests (freshman) 


227. (Not given) Intelligence, high school rank, col- 
lege aptitude test, and freshman 


English .83 

37 3277 High school grades, scholastic 
aptitude,. mathematics aptitude, .69 
C.E.E.B. average, and age (freshman) 


It can be seen that results vary in different places, and definite 
conclusions are not easy to reach. Segel (240), in a summary of 
over 100 studies, concludes that, for general prediction, general 
achievement tests are best (median r—.545), general mental tests 
next best (median r= .44), and specific tests of various sorts next 
(median r= .37). 


PERSONAL DETAILS AND BACKGROUND 


Age. As in the previous summary (104, p. 12), findings on the 
age factor are overwhelmingly to the effect that the younger students 
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get better grades; but it is worth noting that in most cases either 
no account is taken of intelligence (77, 129, 188, 210, 262, 279) or 
else, where intelligence is mentioned, the younger students are found 
to have the advantage (61, 98, 171). In the one study where intelli- 
gence was controlled by matching (233), no difference in the grades 
of the low and high age groups was found. In one study (258) 
students who received mid-term warnings were slightly older than 
average, but averaged lower on the psychological examination. In 
another (287) it was found that, with groups equated for Regents’ 
average, both the under-age and over-age students did better than 
the middle group. In three studies (84, 161, 205) no relation was 
found between age and grades. 

Sex. The preponderance of evidence here is again (see 104, 
p. 12) that women students get better grades than men students (14, 
25, 110, 147, 152, 155, 287). In only two of the studies quoted, 
however, were groups equated for intelligence. One study (44) 
reports no sex differences in intelligence or grades. Another (18) 
found the women reliably higher on intelligence test scores, but only 
unreliably higher on grades. There should be mentioned in this 
connection a study (154) which suggests that male instructors are 
not as objective in grading women students as in grading men; 
although another (322) presents evidence that the sex of teachers 
or students plays little or no part in grades given. 

Family Data. As before (see 104, p. 14), there are conflicting 
findings on position in, and size of, family (196, 210, 287, 301). At 
Purdue (210) the superior students more frequently than average 
had fathers who were professional men and less frequently had rela- 
tives who were Purdue alumni, but a number of other studies (18, 
84, 110, 187, 196) agree in finding no relation between either paternal 
occupation or family college training and grades. Results regarding 
immigrant parentage are also conflicting (61, 110). In one 
study (42) a number of items on the Sims socioeconomic score 
card were found to correlate with grades to a much higher degree 
than they did with intelligence. In another (230) a grouping of 
students according to religious affiliation showed differences in 
average grades which were in some cases statistically: reliable; but 
data on intelligence is not given. 


PHYSICAL DaTA 


Behrens (5) found among 292 probation students a higher per- 
centage of physical or health handicaps than among a control group. 
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Other investigators (110, 301, 307), however, report negative or 
contrary findings. 

It was found in a study of 353 men that with intelligence equal 
for both groups—in fact, with the smokers having a slight advan- 
tage—there was an indisputably reliable inferiority in the scholarship 
of the smokers. Further, among the smokers, those who inhaled 
were reliably inferior in scholarship to the noninhalers (60). 

One study (191) found negligible intercorrelations among metabo- 
lic rate, vital capacity, blood pressure, and scholarship. Lauer and 
Evans (141), in a study of physiological changes resulting from 
experimental “ startle” situations (N —50), found that the return- 
to-normal time of the galvanometric reading correlated —.09 with 
intelligence and +-.35 with grades; the amount of heartbeat change 
with exercise correlated —.22 with intelligence but —.65 with grades. 
At Purdue (210) the “distinguished” students averaged slightly 
heavier than average. At Michigan (197) Pillsbury claims to have 
found correlations as high as +.29 between semester grades and the 
asthenic body type, and uniformly lower grades among pyknics. 


Intelligence is not mentioned. The number of cases varied from 
100 to 371. 

Lorenz and McClure (150) found (N —811) that the 9% who 
were color-blind (Ishihara) were slightly above average on intelli- 
gence and slightly below average on grades. At Bucknell (102), 
using an audiometer, no relationship between grades and auditory 
acuity was found. At Temple (150), using four groups of 25 
students rated for beauty or physical attractiveness by two groups 
of 24 students from another college, the correlations were as follows: 
for girl subjects, beauty ratings correlated +.22 with intelligence 
and +-.28 with grades; for boys, ratings correlated —.08 with intelli- 
gence and +.18 with grades. 

In general, the recent findings do not bear out the positive trend 
indicated by the results on this topic previously reviewed (see 104, 
pp. 12-13). 





PERSONALITY, INTERESTS, ATTITUDES 


Ratings. Ratings on personality “traits” have again been found 
to correlate with grades in a number of instances, but interest in 
this type of study seems to have fallen off (see 104, p. 7). Coeffi- 
cients as high as .57 were obtained from combined ratings by high 
school principals and teachers on intelligence, attitude, industry, 


methods of study, etc. (107, 108). The correlation of these ratings 
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with intelligence or high school grades is not given. At Oregon (263) 
ratings on the American Council on Education Personality Report 
correlated .40 with scholarship, and adding in the ratings raised the 
multiple R of intelligence and high school grades with scholarship 
from .67 to .71. It was also found that ratings by friends on emo- 
tional steadiness correlated .30 with grades, intelligence not being 
mentioned (80); that “distinguished” students were rated above 
average on the Purdue Personnel Rating Scale (210); and that 
judgments by administrative officers correlated higher with grades 
(r’s = .54 to .64) than did intelligence or high school rank (133), 
In one interesting investigation (323), each member of the freshman 
class, at the end of the year, rated as many of his classmates as he 
felt he knew well enough on his degree of like or dislike for them. 
No appreciable relation appeared between scholarship and any of the 
following: number of classmates rated; number by whom rated; 
average degree of liking expressed for others ; average rating received 
from others. : 

Personality Tests. Stalnaker (249) found the Laird C, (intro- 
version) scale to predict grades to the extent that the average grade 
difference between extreme groups gave a D/op figure of 28, 
Flemming (80) got a correlation of .26 with grades. In two other 
studies (95, 110) the same scale correlated zero, or practically so, 
with scholarship. The Pressey X-O test also showed no bearing 
on grades in two investigations (55, 95), although Flemming (80) 
claims a correlation of —.29 for his revision thereof, designed to 
measure emotional reliability or consistency. A sociability scale gave 
a low correlation (r =—.25) with grades for women students (44). 
Scores on none of the Bernreuter scales showed any but negligible 
bearing on grades (183, 274), although Stagner (248) found a 
D/op of 2.51 on average grades between extreme groups on the 
self-sufficiency scale. He also notes that high-self-sufficient, high- 
dominant, and low-neurotic scoring groups show a better correlation 
between intelligence and grades than contrasting groups. At 
Buffalo (44) the highest and- lowest quartiles (N’s=25 and 23) 
of a freshman group on the Thurstone Personality Schedule were 
compared as to intelligence and grades. The well-adjusted group, 
according to the scale, tended to do a bit better on intelligence and a 
trifle poorer on grades. At Ohio Wesleyan (266) scores on various 
personality inventories and on the Ohio Wesleyan Personal History 
Record showed no substantial relation to scholarship (r’s=+.12 
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to —.11). At Wisconsin (327) the same finding applied to the Bell 
Adjustment Inventory. 

Interests. Using an interest test patterned after the Pressey X-O 
test, Drake (55) secured low correlations (.19 to .24) with grades. 
Young and Estabrooks (324, 325, 326), using a special scoring of 
the Strong interest blank, have recently evolved a studiousness scale 
which they find to predict grades at Colgate fairly well (r=.35), 
independent of intelligence. They point out (324) that the items most 
prognosticatory of studiousness, taken together, paint what closely 
resembles the accepted picture of the “introvert.” Mosier (180) found 
this studiousness scale to predict grades quite well (r=.55) for a 
group of 42 arts students, but not so well (r’s=.29, .25, .19) 
for somewhat larger groups of technical and business students. 
Williamson (314) found low correlations of .23 and .10, using two 
different groups of arts students. 

Questionnaires. In a number of studies questionnaire material 
or items on tests which distinguished better from poorer students 
also point to the conclusion that what is commonly called “ intro- 
vert’ behavior is associated with better grades than its opposite (61, 
183, 304). 

Maladjustment. In one study (183) “ non-achievers,” t.e. those 
of high intelligence whose grades did not measure up to their indi- 
cated capacity, more often experienced conflicts about religion and 
life in general than did the “achievers”’ (of equal intelligence and 
better performance). The total number involved here was 118. And 
in another (304), among 1800 students, “discrepant failures ”— 
high on intelligence but low on examinations—were compared with 
“discrepant distinguished” students—low on intelligence but high 
on examinations. The former were found to be more frequently 
handicapped by mental disabilities of the anxiety type. On the other 
hand, it was found (233) that younger students, although reporting 
more frequent “subjective” difficulties at college, earned grades 
equal to those of an older group matched for intelligence ; and (204) 
that the clients of the student mental hygiene clinic showed no 
inferiority in academic performance. 

Liberalism. In three studies (44, 62, 175) there was some rela- 
tion between high grades and a tendency towards liberal or radical 
economic and social views, but in two cases there was also an intelli- 
gence advantage on the side of the liberals, while in the third 
intelligence was not noted. 
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Hicu ScuHoot Factors 


Public vs. Private Schools. In a number of studies findings have 
indicated that public high school graduates do better college work 
than private school graduates (4, 21, 131, 199, 217, 218). In two of 
these (4, 21) the groups were equal in ability, and in one (131) the 
private school group was appreciably superior on college aptitude. 
It is pointed out in the same study, however, that students from 
certain private schools do superior work. In two other studies (9, 
110) no superiority of public school entrants was shown, even though 
in one case (9) these students were, on the average, younger. 

Subjects Taken in High School. Findings here are even less in 
favor of the “ mind-training’’ subjects than in the previous sum- 
mary (104, p. 10). Only one carefully controlled study (246), 
involving a large number of cases (N=1025), showed definite 
superiority in freshman grades for those who had had Latin in high 
school, as against other foreign languages or no foreign language, 
In the case of Latin vs. German, the D/ep was 2.46. 

Two other investigations found: a correlation between grades 
and the number of foreign language and mathematics units offered 
for entrance, and an inverse relation between grades and number of 
social or natural science units (229) ; and a similar positive relation 
for Latin and a negative one for history (77); but in neither case is 
any account taken of intelligence. At Northwestern (24) a corre- 
lation of .20 between grades and the amount of Latin taken in 
high school becomes .04 when intelligence is partialled out. At 
Nebraska (319) and Buffalo (62) Latin or Latin and mathematics 
in high school went with higher grades, but intelligence was also in 
favor of these groups. 

The remaining investigations in this field (51, 52, 109, 124, 184, 
225) all add up to the conclusion that no subject or combination of 
subjects, in any amount, has any noticeable bearing on grades. 

Time Spent in and Since High School. A number of studies at 
the University of Buffalo (260, 261, 262), using groups carefully 
matched for intelligence and other factors, lead to the conclusions 
that (1) boys out of high school two years or more before entering 
college, and girls out one year or more, do superior college work; 
(2) girls who have done postgraduate work in high school before 
coming to college do better work than their controls, but boys do 
not: (3) graduates of high school in three years do no better than 
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their controls. The numbers involved in these studies were rather 
small, the largest in any one group being 65. 

In other studies time out since high school graduation was not 
found to affect grades (110), and entrants direct from high school 
were found slightly inferior on grades (but also on intelligence) to 
transfers from other institutions (147). 

Size and Location of High School. In two studies (164, 210) 
the superior students seemed to come more from large city high 
schools ; but three others (50, 124, 195) do not bear this out, although 
in one of these (124) it was noted that grade prediction was more 
accurate for those from larger schools. 

At Fisk (18) students from southern cities, while reliably inferior 
on intelligence test scores to those from northern cities, receive 
(unreliably) higher grades. In a study at Purdue (249) men from 
other states, although reliably superior to men from Indiana on 
intelligence test scores, and very definitely superior to them on high 
school grades, were not quite reliably superior to them on first- 
semester grades. At Buffalo (287) students from that city get better 
grades than students from elsewhere, with groups matched for 
Regents’ average. At East Central Oklahoma Teachers College (161) 
urban vs. rural background had no bearing on grades. 

Other Considerations. Wakeham (293) found that high schools 
which send a large proportion of their students to the university 
invariably show poorer average college grades for their students than 
those which send only a select few; while Jones (126) found an r of 
.28 between grades and the size of the answer to the question: “ What 
per cent of your (high school) class intends to attend college this 
year?” 

In two studies (92, 164) there was no relation between students’ 
average grades and their schools being accredited or not accredited 
by the regional Association of Colleges and Secondary Schools. 

At Buffalo (287) it has been noticed that students from certain 
Buffalo high schools do better than students from other Buffalo high 
schools matched for Regents’ average; at Pittsburgh (110) no such 
local differentiation was found. 

Sarbaugh (232) found that students who in high school repeated 
examinations, either to raise already passing grades or because of 
failure, were much superior in college work to a group matched for 
age, intelligence, and Regents’ average. 

At Oregon (50) the size of high school teachers’ salaries corre- 
lated .01 with college grades. 
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It is easily seen that in recent years a good deal more attention 
has been paid to high school factors than previously (see 104, pp. 14- 
15) ; and in the present writer’s opinion this is sound procedure, even 
if findings are not yet very clear-cut. 


Stupy Hasits, METHops, ETC. 


Study Habits. A number of investigators, using questionnaires 
and inventories, have found that good students can be distinguished 
from poor achievers on the basis of specific study methods used, 
although not all findings agree on which methods characterize which 
group (68, 120, 183, 228, 300, 304). It should be stated that in 
three of the studies mentioned (68, 120, 300) no account was taken 
of intelligence. Wrenn (320) has evolved a self-rating study-habits 
inventory which he found to distinguish sharply between students of 
high and low scholarship at Stanford, with intelligence equated. 
However, Reeder, at Ohio State, found the inventory of little 
value (207). 

Counseling and How-to-Study Courses. In quite a number of 
places courses or counseling in proper study methods have shown 
gratifying results as measured by improved scholastic work (34, 38, 
65, 90, 170, 228, 296, 316). In one study (297) the interesting 
finding was that counseling of freshmen by senior students produced 
better results than counseling by instructors. 

In two places (5, 63) the effects of how-to-study courses, while 
observable, were slight. In a third (162) conferences with instruc- 
tors and the dean by students doing poorly at mid-term seem quite 
beneficial to students in the upper two quintiles on intelligence, but 
not to those in the lower quintiles. In another (306) special training 
had a beneficial effect which, however, wore off after a year. Three 
studies (83, 128, 312) found counseling and special courses not to 
have any significant effects. 

Reading Rate and Ability. That a relation exists between reading 
rate and comprehension, and college grades, has been pointed out 
by several investigators (12, 69, 178, 300, 321). In one of these 
studies (12) the relationship disappeared when intelligence was 
partialled out—and at Iowa State College (140) there was found an 
r of .205 between intelligence and reading rate—but in at least two 
of the studies mentioned (69, 321) intelligence was definitely not a 
factor in the results obtained. 

The possibility of improving reading ability thus seems a rather 


important question. Experiments on this point are unanimous in 
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their findings that reading ability of college students can be improved 
by special training (43, 45, 140, 176, 177, 190, 223, 267). In one 
study (176) improvement up to 150% (score on reading test) 
resulted from the remedial training. 

Two investigators (72, 177) agree that poor readers’ eye-move- 
ments involve longer fixational pauses and more frequent regressive 
movements. In another study (189) left-handed, ambidextrous, and 
shifted (as to handedness ) students did, if anything, a bit better 
than average on the Minnesota Reading Examination. 

Time Spent in Study, etc. A moderate positive relation between 
grades and time spent in study is reported from a number of 
places (6, 29, 62, 161). The highest correlation figure mentioned 
was .32, and holding intelligence constant raised the r to .38 (6). 
In one study (91) the relationship held for women students (r = .28) 
but not for men. In three others (70, 118, 311) no relation was 
found between grades and amount of time spent in study. 

In one of the above (70) it was found, however, that those using 
the library got slightly better grades than those who did not use it 
at all; and at Ohio University (151) a library information quiz 
correlated .21 with point-hour ratio, intelligence constant. At the 
same institution, Stoke and Lehman (257), with one group of 72 
students in a course in educational psychology, found that the number 
of assigned books actually checked out of the library correlated .49 
with point-hour ratio for the semester. They had another group of 
324 students state how often they had drawn out library books for 
preparing class assignments, and then checked by the signatures on 
library cards. They found that the percentage of overstatement, 
which ranged from 6% to 105%, was inversely related to grades. 
The authors suggest that dull students’ overestimation or misrepre- 
sentation of their studiousness may be a common thing and may be 
the explanation for any findings of a negative relation between intelli- 
gence and industry among college students. 

One investigation (125) found a positive r of .25 between grades 
and regularity of attendance at classes, intelligence constant. 

All told, the studies on this topic add very little to the summary 
in the previous study (104, p. 6). The writer is inclined to agree 
with the conclusion of Uhrbrock (277) that “. . . the total pattern, 
or rhythm, of activities may be far more important than any single 
factor that may be isolated for study,” and to add that the same 
probably applies to other topics in this survey. 




















































DANIEL HARRIS 


TEACHING METHODS AND CONDITIONS 


In three studies at Ohio State (87, 172, 203) sectioning gave good 
results in zoology and German classes. In two of the studies men- 
tioned no account was taken of intelligence. At Purdue (211) 
sectioning in language classes produced no results large or consistent 
enough to justify the sectioning. 

There have been three studies on the effect of weekly quizzes on 
grades in psychology courses, with conflicting results (73, 134, 273). 

One group of sections in two courses in psychology met only 
two-thirds as often as the other, but was encouraged to do wider 
reading in the subject. Achievement tests showed no reliable differ- 
ences resulting (105). Using control groups and courses in the three 
departments given by four instructors, students in classes meeting 
five times a week made greater gains than those in classes meeting 
three times a week ; but the difference was reliable in only one of the 
four comparisons. The investigator points out that the subjects were 
mostly freshmen, that one-third of those in the three-times-a-week 
classes made greater gains than the average in the control classes, and 
that this one-third were predominantly superior students ; and raises 
the question as to what the results would have been had the subjects 
been upperclassmen (192). 

In 1928 Hudelson (115) had reported on a number of investi- 
gations on the effect of class size on grades. He concluded that, in 
general, no convincing relationship is shown, although in some 
experiments the results favored large classes. A carefully controlled 
experiment on mathematics classes (213) indicates, similarly, a tend- 
ency for large classes (43 to 54) to do slightly better than small 
classes (19 to 27), although student opinion in both small and large 
classes favored the smaller-sized class. 

Investigations on the relative efficacy of the lecture ws. other 
teaching methods (3, 92, 106, 149) show no reliable differences. 

Two studies (76, 231) on the relative merits of regular class 
meetings vs. independent study give contradictory results. 

In an experiment repeated for two semesters (243), seniors who 
had one or two weekly periods for small-group discussions and 
directed study, instead of having regular “ recitation” periods at all 
three class meetings, did better on objective tests throughout the 
semester (D/op’s = 1.49, 2.68) and considerably better on the final 
examination (D/op’s =5.07, 2.18). The groups here were equated 
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for age, intelligence, freshman grades, practice-teaching scores, and 
personality ratings. 

An experiment at Minnesota (244) found the project method in 
laboratory psychology decidedly preferable to the “ regular” method 
for superior students but not for others. Another Minnesota 
study (186) established the following descending order of merit of 
class methods in an inorganic chemistry course. although no differ- 
ences were entirely reliable: (1) three hours of laboratory, one hour 
of oral recitation; (2) five hours of laboratory; (3) three hours of 
laboratory, two hours of outside reading; (4) three hours of 
laboratory. 

In one study (318) some slight but temporary effect on the 
achievement of those in the lowest decile and a half of the freshman 
intelligence distribution seemed to result from the institution of super- 
vised study periods three afternoons a week. In another (220), with 
groups equated for intelligence and high school achievement, the 
supervised study hall produced reliably higher achievement than 
home study and unreliably higher achievement than dormitory-room 
study. Those with low mental ability benefited most from the super- 
vised study. The total number of cases here was 151. 

In an investigation (185) with classes in geography, nature study, 
and American government, and groups matched for placement test 
scores and instructors, classes with compulsory attendance showed 
some excess of achievement, for inferior students, over the optional- 
attendance class; but average and superior students did just as well 
under the optional plan. 

In a study carried out in two Teachers Colleges (39) the regular 
method of outside assignments produced results superior to those 
obtained by confining all study to the classroom period. 


INCENTIVES AND Direct MorIvATION 


One 10-semester study (144) showed that, after the fraternity 
initiation academic requirements have been met, average grades tend 
to drop. 


Superior students (upper quartile on grades) had overcome 
greater handicaps to obtain a university education than the inferior 
students (lowest quartile). However, they were also superior in 
intelligence (61). Students placed on probation for poor work tended 
to do better while on probation than previously (69). Of those sent 
mid-term warnings, some, of course, eventually passed, and others 
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finally failed; the former group had somewhat lower psychological 
test scores, had done slightly better on Regents’, and carried more 
school and outside work than the latter group. Of those warned, 
those interviewed by the personnel director showed a lower per- 
centage of eventual failures than those not interviewed, although 
averaging lower on mental test and carrying much more outside 
work (258). A number of positive stimulants to good scholarship 
were tried out on a freshman class, including an encouraging letter, 
an interview with the Junior Dean, reading lists, a letter of com- 
mendation to parents of all students doing outstanding work during 
the first quarter, an honor roll, and a scholarship dinner. The net 
result was an “ unusually high” grade-point average for the entire 
group. There was no control group (59). 

At Hunter College (8) recipients of State scholarships, over a 
long period of years, are reliably superior to nonrecipients on grades 
and other indices of academic achievement. Intelligence data is lack- 
ing. At the University of Melbourne (26) scholarship holders, 
picked on the basis of performance on the School Leaving Exami- 
nations for secondary schools, do definitely superior work. Again 
there is no data on intelligence. At Teachers College, Columbia (247), 
no significant differences in scholastic achievement were found between 
the following three groups: (1) 380 who received financial aid; 
(2) 541 who applied for aid but were rejected; (3) a control group 
of 1102. 

A class of 196 students in a psychology course (75) was divided 
into two groups equated for intelligence and for score on an initial 
psychology achievement test. The experimental group was given 
monthly grades in terms of letters and deciles. The controls were 
simply told that their work was satisfactory, unsatisfactory, or failing. 
Both “A” students and those of low intelligence, in the experimental 
group, did considerably better work throughout the semester. 


STUDENT Loap 


Those who increased their course load from average ‘io heavy at 
Ohio University (145) did somewhat better average work during the 
ensuing semester than those who decreased their course load from 
average to light. At Arkansas (114) students who either increased 
or decreased their load in a given semester t2nded to increase their 
grade-point average more than those who carried a constant course 
load. In this study those who decreased their load showed the greater 
gain. However, in the same group there were low positive correla- 
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tions (.11 to .32) between number of hours carried and grade-point 
average in each of the four quartiles of the intelligence distribution. 
At Chicago (251) it was found that the more comprehensive exami- 
nations a student takes during his first two years at the “ College” 
the better on the average are his grades; but it is suggested that it is 
the more able students who take more examinations. 

At Ohio State (183) the “ achievers ”—those whose point average 
percentile was higher than their intelligence percentile—carried fewer 
courses than the “ non-achievers.” At Buffalo (163) a study of adult 
students in the evening session found those carrying only one course 
to be below the entire group on a 10-minute opposites test, but above 
average on grades. 

Two studies at Kentucky (96, 127) find no relation between 
student load and grades. 


EXTRACURRICULAR FACTORS 


Fraternity Members vs. Dormitory Residents, etc. Results on the 
relationship of fraternity membership to grades are again (see 104, 
p. 10) so utterly at variance with each other in the various studies (14, 
19, 153, 156, 167, 208, 249) that one can only conclude that it depends 
entirely on the local situation and tradition. Students living in their 
own homes in a college town were found to be slightly superior in 
scholarship to students of approximately the same intelligence living 
in the dormitories (100). This study involved 1200 students and 
covered a period of three years. At St. Mary’s College (220) resi- 
dent students got better grades than those nonresident (N =—151). 

Activities and Athletics. A number of studies at various places 
agree that participation in extracurricular activities goes together 
with high grades. In two of these studies there was also an intelli- 
gence superiority for the participating group (166, 171) ; in two this 
was not the case (58, 183); and in the fifth no mention is made of 
intelligence (173). In one (58) athletes receive lower grades than 
nonathletes. 7 

In three studies there is agreement that athletes receive at least 
slightly lower grades than participants in other activities. One of 
these (30) involved 4500 seniors in a large number of Pennsylvania 
colleges, and groups were matched for intelligence; but none of the 
differences were statistically reliable. In another (1€7) the better 
students engaged in debate, publications, and departmental clubs, 
while musical and dramatic activities shared with athletics the onus 
of being associated with lower grades. In the third (155) it was 
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noted that men students got higher grades each year in the spring 
semester than in the fall semester for 10 years; this is attributed to 
the distracting effect of football during the fall. In two studies (61, 
210) the better students (who are also more intelligent) expressed 
themselves as being less interested in athletics than did ordinary or 
inferior students. One investigation (18) found no relation between 
grades and athletics. 

Outside Work. Here, also, findings are largely contradictory, 
In two studies (58, 277) outside work was found to be associated 
with low grades; but in a number of others (81, 173, 271, 286) the 
opposite finding is recorded. In only two of these studies (58, 286) 
were intelligence and other factors equated for the groups compared, 

At Chicago (252) those students registered with the college place- 
ment bureau for part-time employment did reliably superior work 
(intelligence not mentioned). At Minneapolis (278) the conclusion 
is that moderate employment does not lower scholarship, but exces- 
sive employment does—at least for freshmen and sophomores. For 
upperclassmen, full self-support does not seem to be detrimental to 
grades. From Buffalo (258) comes the somewhat contradictory 
finding that, while students sent mid-term warnings put in more than 
average hours of outside work, nevertheless those warned who finally 
passed carried more outside work than those warned who finally 
failed. At Wittenberg (167) it was found that for women outside 
work went with lower grades, while for men the reverse was true. 


PREDICTION FOR SPECIFIC SUBJECTS 


TABLE II 
EncLIsH, Puysics, CHEMISTRY, ZooLocy, Botany 

Reference Relationship 

No. N Predictive Factor to Grades 
English 

284 661 Regents’ average r=.60 

96 120 Codperative English test r=.63 

290 661 Regents’ average 

Codperative English test Equal in value. 
305 4000 C.E.E.B. English test r=.39 
136 300 Iowa English placement test r=.63 
(predicted 33% of grades 
precisely ) 

165 ° Four subtests of above All equally valuable. 

216 1750 Carnegie English test r=.48 

256 183 Iowa English placement tests r=.65 


1 For convenience, much of the following material is presented in tabular 
form. 
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Reference 
No. 


253 275 


Physics 
288 661 


305 4000 


Chemistry 


186 350 


293 2026 


Zoology 
20 62 


288 661 


Botany 


264 202 


TABLE II—Continued 


Predictive Factor 


English placement test 

Psychological examination 

Both of above, plus English 
theme test 


Language aptitude test 
English training test 
High school English grades 


Number of high school lan- 
guage courses 


Intelligence 


Intelligence 


Regents’ in chemistry, physics, 
or trigonometry 
C.E.E.B. physics test 


Having had high school chem- 
istry 


Having had high school chem- 
istry 

Having had high school chem- 
istry and physics 

Having had high school chem- 
istry 

Having had high school chem- 
istry and physics 

High school average grades 

Chemistry aptitude test 

Iowa chemistry placement tests 

Iowa chemistry placement tests 

Ohio psychological examination 

Combination of chemistry and 
intelligence tests 


Having had high school zoology 


Iowa content test—science 


Merely labeling ready-made 
specimen drawings, instead 
of making own drawings 


110 (Not given) Ruch-Crossman biology test 


* See note, Bibliography. 


Relationship 
to Grades 


Four times as valuable 
as English placement 
test. 

Most important 
Other factors — pre- 
vious preparation in 
English, knowledge of 
the fundamentals, time 
spent in studying Eng- 
lish—all had very little 
bearing on grades. 


factor. 


r’s=.55 to .58 
r=.4l 


Makes for initial superi- 
ority, which later dis- 
appears. 


Grades 12% higher. 
" 


Grades 24.5% higher. 
None. 


Grades 2.5% higher. 
High r (unspecified ) 
7=.55 

. 


r 
Yr 


R 


A temporary initial ad- 
vantage. 

r’s=.55 to .61 (best pre- 
dictive measure) 


Reliable superiority. 


r=.56 
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Physical Sciences. At Buffalo (284) Regents’ average corre- 
lated .46 with grades in the physical sciences. At Lafayette (225) 
students got reliably higher grades in college in a science (biology, 
chemistry, or physics) which they also took in high school. One 
study in Detroit (221) found that men students did markedly better 
in geology than girls; but at Wooster (281) women did slightly 
better in geology than men (intelligence not noted). 

In one study (295) the science-mathematics scale of the Minne- 


TABLE III 
PsycHoLocy, MATHEMATICS 
Reference Relationship 
No. N Predictive Factor to Grades 


Psychology 
257 324 Number of assigned books 


checked out of library r=.42 
135 75 Reason given for taking the Genuinely scholastic aims. 
course go with higher grades 
(differences not reli- 
able). 
265 51 Van Wegenen reading scale r=.72 
Iowa silent reading test r=.63 
ud 552 Having had a course in zoology Almost reliable superior- 
ity, intelligence con- 
stant. 
48 * Miner’s interest blank Differentiates reliably be- 
tween A and B vs. 
D and E students. 
Mathematics 
67 (Not given) Algebra test (half hour) Both equally valuable 
High school record (figures not given). 
194 400 Iowa mathematics training test r’s as high as .74 
Iowa chemistry aptitude test 66 
Purdue personality rating scale 52 
A.C.E. psychological examina- 
tion 51 
Purdue placement test in Eng- 
lish 32 
54 387 Percentile rank, A.C.E. r=.26 
High school mathematics grades r=.46 
Average high school grades r=.47 
249 1034 Mathematics aptitude test r=.53 
37 3277 Mathematics aptitude test r=.66 
C.E.E.B. mathematics test r=.25 
305 4000 C.E.E.B. mathematics test 7=.33 
256 183 Iowa mathematics placement 
tests r=.59 
284 661 Regents’ average r=.39 
85 272 High school mathematics aver- 
age r=.50 
66 831 Knowledge of mathematical 
terms r’'s=.59 and .64 
Intelligence test r's=.58 to .85 


* See note, Bibliography. 
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sota interest blank was found to show a significant, though low, r 
with achievement in related fields. One writer (46) concludes from 
studies by various investigators that mathematics does not require 
specialized ability. 

Languages. Williams (309) found that students got better lan- 
guage grades when they had the same language in high school; and, 
where there was no time gap between high school and college study 
of the same language, the grades were even better. Wagner and 
Strabel (289) found the Regents’ average to afford the best index 
to success in any given language. It was also found here that grades 
in languages were more easily predicted than grades in any other 
field. But Held (110), with a composite placement test score, secured 
a correlation of .45 with foreign-language grades, which is less than 
the r with any other subject. 

In a study embracing seven eastern colleges, Whitman (305) 
found that the C.E.E.B. examination in French correlated .47 with 
French grades. At Oregon (54) high school mathematics grades 
gave an r of .46 with French marks. At Pittsburgh (110) the French 
C.R.B. test correlated .34 with French grades. This figure was the 
lowest for any subject predicted by an appropriate placement test at 
this institution. A seven-year study (302) found that training in 
Latin was related to better work in French composition and certain 
other fields in which knowledge of Latin is used directly. At Minne- 
sota (282) it was concluded, after trying various measures, that the 
Iowa Foreign Language Aptitude Test was the best single predictive 
measure for college German grades (correlations, .38 to .49). At 
Case (256) the Iowa French Training Test correlated .79 with first- 
semester French grades. 

Other Subjects. Two studies (74, 110) agree on the value of 
the Seashore tests for musical talent for predicting grades in music 
courses. In one of them (110) 100% of those in the lowest quartile 
on both musical talent and psychological test did unsatisfactory work 
in music. 

Whitman (305) found that the C.E.E.B. examination in ancient 
history correlated .26 with grades in that subject. At Pitts- 
burgh (110) composite placement test score correlated .56 with 
freshman history grades—the highest figure for any subject at this 
institution. Garrett (85) found high school average in history cor- 
related .67 with college history. At Buffalo (284) Regents’ average 
correlated .56 with grades in the social sciences. At Oregon (53) 
grades in history and social sciences were harder to predict than in 
any other subject and were not affected at all by the number of high 
school units in these subjects. 
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At Oregon (54) high school mathematics correlated .21 with 
grades in philosophy. Brown (13) found a multiple R of .59 with 
grades in a home economics course, based on an initial content 
examination, intelligence, honor-point ratio, previous study of foods, 
and estimate on home experience in food preparation. 

In General. It can be seen that there is no unanimous agreement 
as to the relative ease of prediction for specific subjects or the relative 
merit of any kind of predictive factor (e.g. high school grades in a 
specific subject) as compared with any other. In this connection, 
Segel (240), in his summary of a large number of investigations, 
comes to the conclusion that, for predicting scholarship in specific 
subjects, specific aptitude or achievement tests are best. At 
Buffalo (285) the conflicting finding is reported that Regents’ 
average not only is the best predictive measure for grades in general 
but will predict specific field grades as well as, or better than, aptitude 
or achievement tests or anything else. With regard to previous high 
school work in a field, a study at Wisconsin (16) found that with 
one exception, English, there was no relation between the number of 
semester hours of high school work in any field and college grades in 
the same field. 

In three studies (22, 137, 214) a definite relation was found 
between expressed preference for a subject and good grades in that 
subject. Chauncey (22) found that biserial r’s between preference 
and achievement ranged between .50 and .76 for different subjects. 
At Minnesota (279) correlations as high as .41 (for English) were 
obtained between interest scores worked out for various subject fields, 
based on the Minnesota interest blank, and grades in those subjects. 

Differential Prediction. There have been a number of studies 
concerned with predicting not specific grades as such, but the differ- 
ence between grades, e.g. whether a given student is going to do better 
in mathematics than in languages, or vice versa. Various criteria 
have been investigated, including achievement tests (238) and various 
parts of the A.C.E. psychological examination (241) ; but Segel (239), 
who has done most of this work, claims to have secured the best 
results by using various vocational interest scores based on the Strong 
interest blank. He reports r’s as high as .61, using a formula 
described in another article (142). (Using multiple r’s based on 
more than one Strong factor, he reports r’s as high as .87 in predict- 
ing differential achievement in the various parts of the Iowa content 
examination. The number of cases in this study, however, was 


only 100.) 
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More or less related to this topic is the finding (259) that fresh- 
men who entered after postgraduate work in high school, while not 
differing from matched control groups on average grades, did better 
than the controls in social sciences and poorer in mathematics. 

A study (143) investigated the relative predictive value, for dif- 
ferent courses, of the five subtests of the Thorndike Intelligence 
Examination for High School Graduates. These are: (1) reading 
comprehension; (2) linguistic ability; (3) mathematical ability; 
(4) following printed directions ; and (5) general information. Sub- 
tests (2) and (3) were found to have great predictive superiority 
over the others, each in its appropriate field. 

At Yale (36) predicted and actual grades correlated very highly 
for “ verbal” subjects (English, history, languages ; R = .62) and for 
“ quantitative” subjects (mathematics, chemistry, physics, mechani- 
cal drawing, biology; R—.67), the two predictive formulae being 
based on weighted combinations of high school rank and appropriate 
entrance examination scores. 


CURRICULA AND OCCUPATIONAL CHOICE 


Remmers (210) found “ distinguished ” students to be more often 
than others in the schools of agriculture, chemical engineering, or 
science. Fritz (84) found no relation between curriculum and 
grades. In another study (111) those preparing to teach in the 
grammar grades ranked highest, and the kindergarten-primary majors 
ranked lowest. Here, however, there were parallel intelligence differ- 
ences, and none of the differences were reliable. Eckert’s results (62) 
showed the students in the upper-grade quartile to be more interested 
in teaching as a vocation than those in the lowest quartile, but also 
higher in intelligence test score. Eurich (71) found that arts students 
failed most frequently in physics and beginning French, while edu- 
cation students most commonly failed in psychology. 

An analysis of the A.C.E. psychological examination and _ its 
subtests (179) led to the conclusions that (1) the subtests measure 
abilities independent of the ability measured by the total test; (2) the 
completion and opposites tests measure abilities needed in liberal arts 
and social sciences; (3) the arithmetic test measures abilities needed 
in natural sciences and law; and (4) the opposites test measures 
ability important in scientific work. A somewhat similar study (291) 
failed to show that varying the weights assigned to specific subtests 
of the A.C.E. examination improved the prediction of grades in either 
the engineering, agricultural, or general divisions of the college. 
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In studies at Franklin and Marshall, Ohio State, and the Univer- 


sity of London (160, 183, 304) definiteness of vocational choice goes R 
with higher grades. On the other hand, studies at Chicago (328) and 
Minnesota (315) and a State-wide survey in Illinois colleges (187) y 


all fail to find any superiority of grades to go with definiteness of 
vocational choice. In the Chicago study, based on over 2000 cases, 
those who had chosen an occupation did show some advantage in 
grades, but this disappeared when intelligence was held constant. In 
the Minnesota study (N = 860) it was found that, among the women 
students, those with an occupational choice received lower grades, 
although this difference was not quite reliable. 

These inconclusive results on vocational choice are in accord with 


the previous findings (104, pp. 10-11). 


PREDICTION FOR PROFESSIONAL CURRICULA AND SCHOOLS 


TABLE IV 
TEACHING *, Law, MeEpictne, DENTISTRY 
Reference pia 
No. N Predictive Factor 
Teaching * 
82 70 Army Alpha 
82 Thurstone intelligence 
132 840 Combination of intelligence, 
reading, and vocabulary tests 
307 780 Intelligence 
Physique 
49 135 Cox-Orleans prognosis test 
119 94 Admission on basis of “special 
examination on intellectual, 
emotional, and social traits,” 
in lieu of ordinarily required 
Regents’ average 
198 761 Precollege teaching experi- 
ence 
Age — 
Intelligence 
Academic grades 
272 750 Previous teaching experience 
Law 


86 Entire Ferson-Stoddard law aptitude 
freshman examination 
class 
117 (Not given) Combination of undergradu- 
ate grades, intelligence, Yale 
legal aptitude test 





Relationship 
to Grades 

r=-—.03 ‘ 
oa (student teaching) 

= .49( women } (oroteeden 
R= .64(men) grades) 
oe a (professional grades) 
r= aD 


(student teaching ) 

Do a bit poorer in practice 
teaching— intelligence not 
considered. 


r=.26 (men) 

r=.05 (women) 

No effect. (student 
No effect. teaching ) 


r=.09 (men) | | 

r=.40 (women) | 

Do a bit better in student 
teaching; no difference in 
educational theory. 


r=.54 with first-year law 
grades. Intelligence not con- 
trolled. 

R=.64 with four-semester law 
grades. 


Reference 
No. 


Medicine 
181 


Dentistry 
224 


6000 


FACTORS AFFECTING 


COLLEGE 


GRA DES 


TABLE IV—Continued 


Predictive Factor 


Moss medical aptitude test 

Grades in premedical subjects 

Interview ratings by medical 
school admissions committees 

Semester hours of premedical 
credit 

Combination of first two, above 


Moss medical aptitude test 

Thorndike intelligence test 

Premedical grades 

Moss medical aptitude and pre- 
medical grades 

Combination of all three, above 


Downey will-temperament test 


Age 
Downey will-temperament test 
Age 
Miles two-story duplicate maze 


Five mechanical ability tests 
(Wiggly block, finger and 
tweezer dexterity, hand 
steadiness, wax carving) 

Otis S-A intelligence test 


Predental average grades 


Science predental grades 


Otis plus science 


grades 


predental 


Relationship 
to Grades 


% Failures 
Predicted 


69 
43 


64 
-.62 
67 


-—.761 
> —.764 


r’s=.27 to .38 
r=.48 


r’s=.09 to .17 l 
r——.20 { 


r’s=.09 to .33 


rs=.15 to —.17 


(freshman ) 


(senior ) 


4 
(four-year grades) 
r=.44 
(first-year grades) 
r=.53 
(four-year grades) 
r=.41 
(first-year grades) 
r=.59 
(four-year grades) 


R=.67 


(four-year grades) 


*Previous sections have taken up prediction of academic grades in Teachers 
This section is limited to practice teaching and other strictly professional 


Colleges. 
courses. 


Whipple (303) reports an unpublished experiment wherein the 
simple task of sharpening a lead pencil with a knife “in a neat and 
workmanlike manner ”—the result being graded by comparison with 
a series of sharpened pencils of varying degrees of neatness and 
workmanship—was the one test of many tried which completely dis- 





148 DANIEL HARRIS 


tinguished between two groups of dental students rated as “ first- 
class” and as “ distinctly poor” in operative skill. Numbers and 
details are not given. Some German experiments (158, 159, 236) 


report correlations of .73 to .83 between professors’ ratings of dental 
students and scores in several tests of manual steadiness and dexterity, 


Small groups ranging from 12 to 20 in number were used. 


Reference 
No. N 


101 8000 


113 ( Not given) 


157 1100 


TABLE V 


ENGINEERING 


Predictive Factor 


Iowa chemistry placement 
examination 

Iowa placement examinations, 
total average 

Iowa training tests average 

Iowa aptitude tests average 

Mathematics training test 

Pre-engineering physics 

All pre-engineering courses 

Pre-engineering chemistry 

Pre-engineering mathematics 

A.C.E. psychological exami- 
nation 

Iowa silent reading test 

Engineering interest, Strong 

Stenquist mechanical aptitude, 
No. 2 

Stenquist, No. 1 

Stenquist Assembly, I 

Thorndike word knowledge test 

Ascendance (A-S blank) 

Missouri (Mann) drawing ap- 
titude test 


Iowa placement tests and 
Strong interest test 


Otis advanced intelligence test 
Freshman mechanical drawing 
Freshman engineering grades 
Freshman.mathematics grades 
Combination of all four, above 


Relationship 
to Grades 


r's=.23 to .63 
(first-semester grades) 
R=.67 to .75 
(freshman grades ) 
R=.52 

R=.39 } (first-quarter grades) 
r—.54 

7.57 

R=.52 

r=.45 

r=.40 


r=.55 


No relation. 

No relation. 

No relation. 

r=—,40 

r's=.63 to 81 

(grades in engineering draw- 


ing) 
R’s=.50 to .77 
(in various specific subjects) 
r=.40 } 
r—.45 
r=.58 } (advanced engineering 
r=.63| grades) 
R=.69 } 


Various. At a training school for nurses (219) the following 
measures were correlated with practice and theory grades of 48 first- 
year students: (1) Stanford-Binet; (2) A.C.E. psychological exami- 
nation; (3) Moss social intelligence test; (4) Moss nursing aptitude 
test; and (5) Bernreuter blank. None of these had any appreciable 
r’s with practice grades. With theory grades, their r’s were as fol- 
lows: (1) .37; (2) .62; (3) .30; (4) .62; with intelligence constant, 
.26; (5) negligible. 
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At a school of social work (168) the following factors, based on 
a study of 530 entrants over a 12-year period, were found to distin- 
guish groups between whose average grades the differences were 
statistically reliable: (1) college graduation; (2) number of under- 
graduate courses in sociology and the social sciences. Data on 
intelligence is not presented. The following had no bearing on 
achievement: (1) undergraduate activities; (2) interval between 
college graduation and entrance to the school; (3) size of home 
community. 

One study (245) found that, except for law, college preprofes- 
sional language grades are not related to subsequent grades in the 
professional schools. 

MISCELLANEOUS 4 

Prediction of grades has a number of times been found easier or 
more accurate for some groups of students than for others. Three 
studies (78, 283, 287) found prediction better for women students 
than for men, while one (50) found no increase in accuracy of pre- 
diction to result from considering men and women separately. Two 
of the studies (283, 287) also indicate that prediction is better for 
homogeneous groups in general, e.g. as to age, sex, school. This 
contradicts a previous finding (104, p. 16). Another finding (287) 
was that prediction is better for those with high Regents’ average and 
for those with high motivatien. 

In one investigation (10) students in psychology classes estimated 
their own course grade before taking the final examination and again 
after the examination. The correlations with the actual grades were 
66 and .74. The same students had rated their instructor, on a six- 
point scale, on his ability to teach. There was no relation between 
estimated or actual grades and ratings of the instructor. In 
another (28), involving three instructors, it was found in the case of 
one instructor that knowledge of his students’ scores on a previous 
intelligence test produced a statistically reliable effect on the grades 
he gave them. Mention has already been made (see p. 144: Predic- 
tion for Specific Subjects: In General) of the relation between 
preference for a subject and grades in it. In one of the studies 
quoted (214) it was noted that attitudes toward subjects varied 
significantly with the instructor involved. 

Grades in correspondence courses correlated .58 with grades in 
residence work (138). 


It is pointed out (15) that academic failures are easier to predict 
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than successes, the explanation being that a lack is hard to remedy, 
while an ability is easy to neglect. An analysis of probationers (201) 
led to the conclusion that in no case was it a single cause, but rather 
a constellation of troubles (study habits, social difficulties, emotional, 
vocational adjustment) which, in combination, caused the unsatis- 
factory work. 

Mentioned previously (see Extracurricular Factors: Activities 
and Athletics, p. 139) was the improvement in spring grades for men 
students (155), attributed to the absence of fall football distractions, 
In a study (14) based on 7000 students of both sexes over a period 
of eight semesters, there was found a marked tendency for the grades 
of all groups to be higher in the second semester of the year than in 
the first. In another (222) there was a similar finding. Here it was 
noted that third-semester grades are the low point of the entire college 
career of both engineering and arts students (N = 200) ; and this is 
attributed to the distractions of the social development and maturation 
which allegedly take place at this period of college students’ lives, 
One investigation (226) notes that the r between January grades and 
the following June grades is a bit higher (.79) than that between 
June grades and the following January grades (.72). 

A study at Minnesota (313) found that from 1929 to 1935 there 
was a rather steady decline in the correlation between scores on the 
college aptitude test and scholarship, for men in the College of Science, 
Literature, and the Arts, from an r of .40 to one of .19. A number 
of possible explanations are suggested, one of them being the efficiency 
of the personnel counseling system at Minnesota. 

Jones (123) maintains that the higher the correlation between 
university grades ‘and intelligence test scores the greater must be the 
scholastic motivation of the students. He points out that at Buffalo 
the correlation has increased with the institution of a number of 
administrative measures designed to encourage and reward good 
work. Along this line it was found (31) that the correlation between 
Alpha and marks on an expected final examination was .52, while 
with quizzes sprung without notice the r was only .06. And another 
study (298) leads to the conclusion that drive is the most important 
factor in the success of students. 


CoNcLUDING NOTE 


The writer has been reluctantly forced to the conclusion that his 
previous summary has not been widely read or seriously pondered 
upon by the fact that he finds so many investigations since that date 
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still suffering from the faults he then castigated (see 104, p. 15), 
namely: (1) heterogeneity of subjects; (2) failure to hold intelli- 
gence constant ; and (3) failure to present results in terms of statisti- 


cal reliability. 

It is the writer’s personal opinion, unsupported by anything 
resembling a regression equation and based merely on his own find- 
ings, total immersion in the results and opinions of other investi- 
gators, and some cogitation, that the essential factors in student 
achievement are, in the order of their importance: 

(1) Ability (or intelligence, or scholastic aptitude, etc.). 

(2) Effort (or drive, or degree of motivation, etc. ). 

(3) Circumstances (personal, social, economic, academic, etc.). 
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“CONSTANCY ” OF THE IQ 


BY ROBERT L. THORNDIKE 
Teachers College, Columbia University 


It is the purpose of this article to review recent experimental 
evidence on the stability of intelligence test performance by the same 
individual over a lapse of time. The content of the review will ‘be 
limited in two ways: 

(1) It is proposed to consider chiefly those reports which have 
appeared since 1930 or 1931. The literature previous to that time 
has been covered by Nemzek (75), Foran (24, 25), and others. 
Earlier studies will be referred to only as this becomes necessary to 
understand recent trends of investigation. 

(2) It is proposed to limit this report primarily to those investi- 
gations in which the same individuals have been tested more than 
once. Although studies of the resemblance of relatives, the intelli- 
gence level of occupational groups, etc. provide evidence concerning 
the relative weight of hereditary, as opposed to environmental, factors 
in producing differences in intelligence test performance, and conse- 
quently bear on the stability of the performance of the individual, it 
is not our purpose to try to cover this broader literature here. 

Certain general trends seem to appear in the type of experimenta- 
tion reported within the last eight years, and it might be profitable 
to consider these briefly before going on to a survey of findings. In 
the period now being studied, the general fact of a rather high posi- 
tive correlation between test and retest after an interval of time in 
children of school age seems to be fairly well taken for granted, and 
telatively few studies were found which were concerned merely with 
demonstrating this point. Most studies try to push this finding 
forward in one or another of several directions, of which the following 
are rather typical: (a) attempts to get retest information on younger 
and younger children; (b) efforts to get retest data over longer 
periods of time ; (c) efforts to determine the effect upon test constancy 
of particular types of environmental manipulation. 


GENERAL StupiEs oF Test-Retest CORRELATION IN ADULTS 


Several studies of college students at the beginning and end of 
their college course have been carried out, primarily with the purpose 
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of studying gains during the years at college. These studies haye 
agreed in finding that those students who remain through a college 
course tend to gain in intelligence test score. Test-retest correlations 
are also reported. Between scores on the Thorndike Intelligence 
Examination for High School Graduates, Part I, before and after a 
two-year course at normal school Masters and Upshall (63) report 
correlations of .78 and .80 for two groups. Wolcott (111) finds a 
correlation of .81 between scores as freshmen and as seniors on the 
Thorndike examination. McConnell (66) reports a correlation of 
.83 between freshman and senior scores on the American Coungl 
examination. For this same test, given before and after a four-year 
interval of college, Livesay (55) reports a correlation of .88 for total 
score and correlations of .69 to .82 for individual subtests. 

Testing reformatory inmates with an average age of about 20 
years on the Kuhlmann-Anderson test, Hales (32) reports a corre- 
lation of .89 when the average mterval between test and retest js 
about five years. 

Two studies have been made of small groups of adults over a 
10-year interval. Davidson (17) reports a correlation of .89 fora 
group of 50 insurance company employees tested with a brief group 
intelligence test. Garrison (28) retested a group of students, whose 
average age at time of first testing was 25 years, with the Yerkes 
point scale, using a revised scoring. The correlations were .58 for 
a group of 32 men and .76 for a group of 41 women. 

In view of the limitations placed by the reliability of the tests 
used and the limited range to be expected in some of the groups, 
these studies offer confirming evidence of the stability of intelligence 
test performance in adults in fairly stable surroundings, even over 
quite a period of years. 


StupDiEs oF ScHooLt-AGE CHILDREN 


In the decade of the twenties, reports of correlations and mean 
differences between intelligence tests and retests given after various 
lapses of time were quite numerous for general school-age popu- 
lations. In the thirties, however, relatively few such studies have 
appeared. Studies of too recent a date to be included in Nemzek’s 
review (75) are reported below. 

Lauderbach and Hause (47). McCall multi-mental test given to 
150 students in Grades 4 to 6. Retested after an interval of Il 
months. Test-retest correlation of .791. 

Lincoln and Wadleigh (53). Otis Primary, National A, and 
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Terman A given in successive years at approximately one-year inter- 
yals. The 154 children tested were in Grade 3 at the time of the 
first test. The median of all the differences in 1Q was reported as 
7.29 points. Thirty-seven per cent of the changes were 10 points or 
more. 

Miller (69). The data reported by Hirsch (37) were reanalyzed, 
using only those cases who took all tests and converting the scores on 
successive tests into scores with the same mean and standard devi- 
ation. The median difference was found to increase progressively 
from 5.4 IQ points at a one-year interval to 7.3 points at a five-year 
interval. The corresponding correlation dropped from approximately 
85 to .79. 

Lithauer and Klineberg (54). The test-retest correlation for the 
Stanford-Binet for 120 orphans first tested shortly after entrance to 
the Hebrew Orphan Asylum, and retested after a median interval 
of 14 months, is reported to be .76. 

Traxler (102). Otis Self-Administering IQ’s were obtained for 
students at the University of Chicago High School. Retests were 
obtained after intervals of one, two, and three years. Correlations 
were .756, .673, and .676, respectively. For the same students, the 
retest correlation for the Binet with an interval of one to four years 
is reported to be .762. 

Seagoe (87). Tests were given to school children in Grades 1, 
3, 5, and 7. Tests were Detroit First-Grade, Detroit Primary, 
National Intelligence, and Terman Group, respectively. Retests were 
available for those children who remained in the system long enough 
to be caught by two or more tests. Successive tests were at two-year 
intervals. Correlations were reported as follows: Detroit First- 
Grade vs. Detroit Primary, .642; Detroit Primary vs. N.I.T., .700 
and .727; N.L.T. vs. Terman Group, .797 and .869. Over a four- 
year interval, Detroit Primary correlated with Terman Group .765. 

Lorge (57). About 160 boys, a random sample of 860 who had 
been tested in the eighth grade, were retested after an interval of a 
little over 10 years. Both raw correlations and correlations corrected 
for estimated test unreliability were presented. These were: 


Raw Corrected 
Thorndike-McCall Reading Scale..... i Be 57 74 
I.E.R. Arithmetic Test................ 5% 60 79 
Stenquist Assembly Test......... eh .66 1.06 
I.E.R. General Clerical Test.......... ara 63 75 


The results of these studies fit in quite well with the general trend 
of earlier investigations, and the size of the correlations reported here 
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is comparable to the average for the Stanford-Binet and for group 
tests reported in Nemzek’s summary (75). 

The somewhat lower correlations reported in the study by 
Lorge (57) direct attention to the factor of interval between test and: 
retest as one factor influencing the stability of performance from teg 
to retest. his has been considered by R. L. Thorndike (101) ing 
synthesis of previous studies. Putting together the data from thoge 
studies in which the interval between test and retest was fairly 
uniform, a curve was fitted to all the data by the method of least 
squares. The correlation for an immediate retest, estimated from 
this curve, falls close to .90, and the correlation for a test after a 
five-year interval is estimated to be about .70. Further data on the 
question of time interval is contributed by the study of R. R, 
Brown (8). In one comparison of 58 cases who had a first retest 
after an interval of less than two years and a second retest after an 
interval of five years or more, he found the correlation between initial 
test and first retesf to be .86 and between initial test and second 
retest to be .61. “It seems apparent that one factor which influences 
the accuracy of prediction from an intelligence test at school age is 
the length of time over which we endeavor to predict. 


Stupies oF GIFTED CHILDREN First TESTED AT ScHOOL AGE 


The performance upon adult tests at maturity of gifted children 
first tested while in school has been studied by Hollingworth and 
Kaunitz (39) and by Lorge and Hollingworth (58). In the first of 
these studies, 116 children who had received Binet IQ’s of 135 or 
higher when they were about eight years old were tested with the 
Army Alpha approximately 10 years later. Of these children, who 
had been chosen to represent the top centile of the child population, 
82% scored in the top centile on the Army Alpha according to the 
Army norms. The remainder were in the high centiles. No indi- 
vidual regressed to, or nearly to, average. Lorge and Hollingworth 
report additional test material on a group of 21 individuals who, as 
children, had a median IQ of 168. On the CAVD intelligence test, 
20 of the 21 were found to surpass the median of law school fresh- 
men. Those who had, as children, received IQ’s of about 140 were 
found to define approximately the seventy-fifth percentile of college 
graduates, taking it the country over. 

Whether the IQ’s of gifted children tend to increase or decrease 
has been studied by Lincoln (49, 50, 51, 52) and by Cattell (10). 
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Lincoln finds that, when children are classed on the basis of their 
initial test; those with high initial tests tend to decline. Cattell finds 
that, when children are classified on the basis of the average of their 
jnitial and final tests, children in the highest classifications tend to 
have higher final tests than initial tests. These results are in no sense 
contradictory, the first illustrating the universal tendency of, any 
fallible measure to regress toward the mean of the sample” upon: 
repetition, and the second being evidence of a tendency for the varia- 
bility of IQ’s in the population to increase with increasing age. 
Nemzek (73, 74) analyzed data previously reported by Carrolh 
and Hollingworth and by Lamson, and found correlations between 
Stanford-Binet test and retest with an interval of one or two years 
of 53 to .72. For data on the Herring-Binet the correlation was .73. 
For the Stanford-Binet the mean change ranged from 7.85 to 9.37 
points, and for the Herring-Binet it was 9.06 points. Nemzek con- 
cluded that the IO was more variable for gifted children, which is in 
accord with other results on this question. McNemar (67) has 
pointed out that the greater variability of high IQ’s is inherent in 
the use of a ratio and appears as an inevitable consequence if the 
variability in performance at a given mental age level is constant. 


RETARDED OR OTHERWISE HANDICAPPED GROUPS 


Studies of the mentally retarded by Arthur (2), Engel (21),,. 
Hoakley (38), Parker (77,78), and Woodall (112) confirm, in gen- 
eral, previous findings of a progressive decline in the IQ’s of these 
groups. Woodall finds an increase in IQ for retarded individuals 
after the age of 16, indicating that mental growth has not completely 
stopped at that age. The greater stability of the IQ for defective 
groups again becomes apparent in these studies. As mentioned 
above, it must be remembered that this is implicit in the use of such 
a ratio as the IQ. 

Schott (84) reports that adult neuropsychiatric cases show 
markedly greater variation between tests than do normal children. 

Miller (68) suggests that a pseudointellectual deficiency may be 
produced by emotional maladjustment, and that this is remediable 
with careful treatment. 

Arthur (3) finds that “an examination with the Kuhlmann-Binet 
given by an experienced psychologist to a kindergarten, first, or 
second grade child from a non-English-speaking home can yield a 
rating with a high degree of reliability and predictive value as meas- 
ured by achievement on the Kuhlmann-Anderson 5 to 7 years later, 
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if the child has had as much as a year in the English-speaking schoo] 
environment.” 

Gildea and Macoubrey (30) and O’Neill (76) have endeavored 
to analyze the factors associated with large changes in IQ between 
test and retest. Lack of a control group in one study, and limited 
statistical analysis in both, preclude any definite conclusions. The 
pattern of factors involved seems to be quite complex in any event, 


PREDICTIVE VALUE OF INFANT AND PRESCHOOL TESTS 


Following upon the success of the intelligence tests for school-age 
children, efforts have been made to develop tests for younger and 
younger children. The years covered by this review have witnessed 
a growing crop of studies of the constancy and predictive value of 
indices derived from these early tests. Evidence on the predictive 
value of tests at these early years has been presented by Bayley (6), 
Cunningham (15), Driscoll (19), Furfey and Muehlenbein (27), 
Gesell and others (29), Hallowell (33), Herring (35), Honzik (40), 
Hubbard (41), Kawin (43), Mowrer (70), Nelson and Richards (71, 
72), Stutsman (98), Symmes (99), Updegraff (103), and Well- 
man (105, 106, 107). These studies cover a number of different tests 
and differ quite a bit in the age and other characteristics of the 
population studied. Consequently, any effort to lump the studies 
together and generalize from them may well be questioned. How- 
ever, we have tried to do this in a rough sort of way. Taking the 
test-retest correlations from those studies in which they were avail- 
able, and in which the tests had been given at fairly definitely specified 
ages, it was possible to tabulate them according to age at first test 
and interval between test and retest. This has been done with 
correlations from the studies by Bayley (6), Cunningham (15), 
Driscoll (19), Furfey and Muehlenbein (27), Herring (35), 
Honzik (40), Kawin (43), and Nelson and Richards (71, 72), and 
from the earlier study by Goodenough (31). A table was prepared, 
and each correlation coefficient was entered in the appropriate cell. 
The average correlation coefficient in each cell was then determined, 
as a rough estimate of the degree to which it is possible to predict 
over that interval at that age. These average coefficients are pre- 


sented in Table I. 
In any interpretation of Table I the diversity of tests studied, 
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the diversity of final tests being predicted, and variations in the 
range of ability in the groups being studied must be borne in mind. 
Furthermore, no effort was made to apply differential weights, taking 
into account the different sizes of populations studied, to the cor- 
relations upon which Table I is based. Finally, some of the figures 
reported in Table I are based upon only one correlation or the work 
of only one experimenter, while others are based upon a number. 
However, it seems clear from these results that the adequacy of 
prediction of future mental development is a combined function of 
the age at which the test is given and of the length of time over which 


TABLE I 


Composite oF Test-Retrest CorreLatTions From Severat Srupies or INFANT 
AND PRESCHOOL GROUPS 


Interval Between Test and Retest 





Age at Lessthan 49 10-15 16-21 22-29 3-41 42-53 Over 
Earlier Test 4mos. mos. mos. mos. mos mos. mos. 53 mos. 


Under 4 mos. F 33 10. —& ~.09 

49 mos. y 53 .49 ee 16 4 00 

10-15 mos. ‘ 66 50 AS 33 

16-21 mos. j 68 51 38 25 
mos. d 74 68 
mos. ’ 68 .66 57 ; 56 
mos. ; 65 72 66 3 63 
mos. 76 73 


we try to predict. Before the age of four months, it seems impossible 
to get satisfactory correlations between tests given no more than a 
month apart. From this point on, however, correlations of .75 and 
better seem to be obtainable for such short-time forecasting. But 
it is not until the individual is about four years old that predictions - 
can be made over a span of as long as a year with a correlation of 
/0 or better. It does not seem possible to make very reliable long- 
time predictions from any of these early tests. 

Honzik (40), working with tests between the ages of 21 and 84 
months, suggested that the size of correlation might be thought of 
as a linear function of age at the time of the first test divided by age 
at the time of the second test. She found a correlation of .92 for one 
group and a correlation of .78 for another between this ratio and the 
correlation coefficient between test and retest. Apparently, within 
the range of ages and intervals that she studied, the relation did hold 
in an approximate sort of way. Whether it would hold as well over 
a greater range of ages is, of course, open to question. English and 
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Killian (22) present some evidence to suggest that age is not a factor 
in determining the amount of variation between tests between the 
ages of 4 and 16. 

At least two alternative explanations of the limited value of these 
early tests in predicting school-age intelligence test performance may 
be suggested. On the one hand, it may be argued that at this early 
age the individual is very susceptible to environmental influences, 
and so his intellectual development is likely to be markedly accelerated 
or retarded, depending upon the environment to which he is exposed. 
Low correlations between early and late tests would be thought of 
as due to the differential effect of different environments. On the 
other hand, it may be suggested that the aspects of mental functioning 
studied by early tests are rather different from those incorporated in 
school-age tests. This seems at least superficially true, as we con- 
sider the test materials. It may be that in the early tests we are not 
testing quite the same type or aspect of “ intelligence” that we test 
in school children, possibly because the type of function which we 
study in our later tests is not as yet present or susceptible to testing, 
The low correlations would, in this case, be explained as due to a 
considerable shift in the aspect of intellectual functioning which we 
were testing. 


Errect Upon INTELLIGENCE OF CHANGE IN PHYSICAL 
ENVIRONMENT 


Studies of the effect of diet upon learning and intelligence are 
reviewed by Fritz (26), who concludes that “ contrary to what would 
be generally supposed, there is very little experimental evidence at 
-present to indicate that diet markedly affects intelligence or capacity 
to learn.” Fritz indicates that the results of Maurer and Tsai, show- 
ing the effect of vitamin B deficiency upon maze learning in rats, 
represent an exception to this generalization. 

Balken and Maurer (5) report a preliminary study of the effect 
of increasing the vitamin B intake of malnourished children. A 
variety of mental tests were given to 46 children from homes of very 
low economic status in which vitamin B deficiency was probable. The 
children were given supplementary feedings of vitamin B for 15 
weeks, and tests were repeated at the end of this time. Gains were 
found upon each test, but these were generally small.- No control 
group was used, so it is impossible to determine whether there would 
have been any residual gain after practice effects had been eliminated. 
Poull (80) matched 41 pairs of records from the testing at New 
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York City Children’s Hospital. One member of each pair was orig- 
jnally malnourished, as judged by the physician’s statement, and 
subsequently became well nourished. The control group was uni- 
formly well nourished. The group whose nutritional status improved 
showed an average gain in IO of approximately 10 points, while the 
control group showed no change. 

Studies of the effect of diseased tonsils and adenoids by 
Richey (81) and of intestinal protozoa by Schell (83) gave negative 
results. 

Mateer (64) finds evidence that gains may result from a com- 
bination of glandular and educational treatment in hypopituitary cases. 

Dawson (18) reports small, but reliable, gains in IQ for a group 
removed from a slum to a housing project. 


EFFECT OF CHANGED SOCIAL OR EDUCATIONAL ENVIRONMENT 

We must next consider various investigations which introduce 
certain special factors into the social or educational life of the child. 
Familiar patterns, occurring during the period covered by this 
survey, are (1) placement in a foster home, (2) placement in some 
sort of institution, (3) experience in a nursery school, (4) exposure 
to some special type of schooling in the regular grades, either in the 
general character of the school or through some special type of 
instruction. } 

To the earlier foster-child studies there have been added during 
this period studies by Hinton (36), Leahy (48), Lithauer and 
Klineberg (54), Schott (85), Skodak (95,96) [the report by Skodak 
is taken as most completely representing the lowa investigation, parts 
of which have also been reported by Skeels (90, 91, 92)], and Wells 
and Arthur (109). The last four of these authors report average 
changes from before placement in a foster home to some time after 
placement. The studies by Lithauer and Klineberg, Schott, and 
Skodak agree in finding an average rise of about six points in IQ 
for retests given within a year or two of placement. In the Skodak 
study there was some indication of further gain upon a subsequent 
retest. 

The study by Skodak centered its attention upon the intelligence 
of a group of children (mostly known to be illegitimate) who were 
placed in foster homes before the age of six months. Each of these 
children received two tests, the first at an average age of two years, 
seven months, and the second at an average age of four years, four 
months. The average IQ was 116.0 at the time of the first test and 
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111.5 at the time of the second test. The correlation between these 
two tests, with an average interval of 21 months, is reported as .56, 
This corresponds quite closely to correlations reported in other 
studies for this age and interval, as summarized in our Table I. 

The average level of 1Q at the time of the second test is in close 
agreement with results reported by Leahy (48) for somewhat older 
foster illegitimate children. Leahy reports a mean IQ of 110. What 
judgment we shall make as to the contribution of the improved foster 
environment to this average IQ depends upon a judgment as to the 
IQ expectancy from the true parents. For some of her cases Skodak 
presents data on true-father’s occupation, true-parent’s education, and 
true-mother’s Binet IQ. Occupationally, the group is clearly low, 
though just what that means for intelligence in these times is not as 
clear as it would have been a decade or more ago. The mean mid-parent 
education is 10 grades, which seems fairly substantial. The mean 
IQ of mother is reported as 88.7.. This, however, is derived from 
the Stanford-Binet, with 16 years as the divisor, and is established 
for a group of unspecified age, tested at an unspecified interval after 
their school experience. An assumption that the genetic expectancy 
of the foster group was below 100 IO seems to be open to ques- 
tion—especially as the preplacement I1Q’s of the group of children 
tested before and after placement (mentioned earlier), who repre- 
sented a poorer sample educationally and occupationally, had an 
average of 98.5. 

The relationship of foster-child intelligence to characteristics of 
the true and foster parents is somewhat vitiated in this study by 
selective placement. There was a correlation of .30 between mid- 
parent education for true and foster parent. This indication of selec- 
tion makes any interpretation of relationship of child intelligence to 
either characteristics of true parents or characteristics of foster 
parents somewhat hazardous. 

Wells and Arthur (109) studied children one or both of whose 
parents were classed as feebleminded. They compared the course of 
intellectual development for children left in their own homes with that 
of children placed in certified foster homes. Over a period of about 
five years the differential between these two groups amounted to 
about eight IQ points in favor of those in foster homes. For a rather 
small group, first tested before the age of five years, the differential 
was approximately 16 points. 

Rather in contrast with the other findings, Hinton (36) reports 
that children transferred to Mooseheart, which is judged to presenta 
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social environment greatly superior to the one these children had 
previously known, show no reliable change in IQ over a period of 
five years subsequent to entrance. However, the younger children 
are reported to gain more than the older ones. 

Change in IQ during residence in an orphanage or during 
residence in an institution for the feebleminded was studied by 
Crissey (13, 14). The finding which is emphasized in this study, as 
in a number of the other studies from the University of Iowa, is that 
the extreme members of the group tend to move toward the average 
of the group upon repeated tests. There does not seem to be sufficient 
appreciation, however, of the significance of errors of measurement 
in producing this change. It must be borne in mind, as one reads 
any of these studies, that those membefs of any group who are at 
the top of the group in score on a fallible measure, whether the scores 
be high or low in terms of some more extensive total population, 
fall at the top of the group, in part because of a preponderance of plus 
errors of measurement, and will tend, in the absence of other influ- 
ences, to drop back toward the mean of the group in any test that 
is not perfectly correlated with the first test. The converse is, of 
course, true of those at the bottom. 

Crissey also finds a suggestion that those who would be expected 
to have constant IQ’s in an orphanage tend to show a drop when 
transferred to the home for feebleminded, and that those who would 
be expected to drop in the home for feebleminded fail to do so when 
transferred to the orphanage. This is interpreted as being due to the 
difference in stimulation value of the two environments. One must 
bear in mind, however, the possibility that other factors, which were 
a symptom of a dropping IQ, caused the transfer to the asylum, or 
that other behavior, indicating intelligence superior to that demon- 
strated on the test, may have been a factor in transferring children 
from the asylum to the orphanage. 

A number of studies have brought out the retarding effect of an 
impoverished environment by comparing IQ’s of younger and older 
children in the same family or district. During the period of this 
review, studies based upon such data were reported by Asher (4), 
Ludeman and McAnelly (60), Sherman and Key (88), Skeels and 
Fillmore (93), and Wheeler (110). Since they fall somewhat out- 
side the focus of interest of this review, no further mention will be 
made of them. 


2. 


Basa 


e 


To the earlier studies of the effect of nursery school training, the 
period here being reviewed sees the addition of the series of reports 


7 
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by Wellman and others (12, 105, 106, 107; 108) of children in the 
University of Iowa preschool, the study by Skeels, Updegraff 
Wellman, and Williams (94) of a nursery school project in an Iowa 
orphanage, and the report by Kawin and Hoefer (44) of a nursery 
school project near Chicago. 

The findings from the Iowa studies may be summarized somewhat 
as follows: 

(1) The Binet IQ of children from generally superior homes 
rose markedly during a period in nursery school, but did not rise 
during the summer spent in the general home environment (105), 

(2) Performance on the Merrill-Palmer test showed some residual 
gain, after nursery school attendance, over and above apparent 
adaptation effects, but the influence was not as marked as for the 
Binet (108). 

(3) The gains in Binet IQ were maintained by a sample of chil- 
dren located and tested after several years of attendance at other than 
University schools (106). 

(4) The gains in Binet IQ were further added to by a sample 
of children who remained in the University schools and were tested 
at a later time (106). 

(5) Length of attendance at the University schools was related 
to intelligence test score in high school and at college entrance (107), 

(6) Gain from attending preschool was not related to the occu 
pational level of the parent (12). 

(7) The greatest gain in preschool was for those who originally 
received the lowest scores, and the smallest gain, for those who 
received the highest (105). 

(8) In an orphanage preschool, attendance of 200 days or more 
resulted in some gain in Binet IO, whereas a control group exposed 
to the general orphanage environment for this same length of time 
showed some loss in IQ (94). 

The study by Kawin and Hoefer (44) failed to confirm the gains 
found in the Iowa studies. It was found that the gains in Merrill- 
Palmer score made by the nursery school group and the paired con- 
trol group were substantially identical. These gains were attributed 
to practice. 

There seems to be little question as to the genuineness of the 
immediate gains in Binet IQ in the Iowa preschool studies. How- 
ever, before we accept the conclusion that gains so produced are 
permanent and lasting, it might be well to consider certain other 
explanations of their apparent permanence. The permanence of the 
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gains is inferred from conclusions (3) and (4) above. Before the 
lasting rise is definitely assigned to the nursery school experience, 
there are three other possible factors which should be considered: 

(1) There is a possibility of selection. If we admit that there is 
a considerable element of chance in the score which a young child 
will make upon an intelligence test, we must agree that those children 
who test at 100 IQ, for example, cover quite a range in underlying 
“true” ability. Some have been overestimated in our testing and 
some underestimated. If there is a tendency for those who have been 
overestimated—whose “ true”’ IO is below 100 and who will conse- 
quently tend to drop on later tests—to be eliminated from the popu- 
lation which is retested, we may expect the fraction of the original 
population to whom we give retests to gain in IQ. 

(2) Though these children did not gain between sessions of 
nursery schools, there is still the possibility that the sudden gain 
during nursery school represents a short-circuiting of a gain that 
would have appeared gradually over the years as a result of a gen- 
erally superior home environment. That is, if the people who send 
their children to nursery school are people who will provide generally 
good and stimulating homes for their children, the sudden gains 
which appeared during nursery school and were subsequently main- 
tained might have appeared as a slow cumulative development in the 
home environment. This would assume a temporary nursery school 
effect, sudden and gradually wearing off, balanced by a more per- 
manent home effect, gradually making itself manifest. 

(3) It may be that children from the genetic background repre- 
sented by the University nursery school groups tend to do better on 
the type of intelligence tested at the older ages than on the type tested 
in the first few years. In this case, then, the permanent change would 
be explained as due to a change in the character of the test with the 
older children to a test upon which they were, by genetic constitution, 
more extreme deviates than they were upon the early tests. 

It must be admitted that we do not have much evidence to back 
up any of these suggestions. However, it does not seem that they 
are excluded as possibilities. : 

Peterson’s study (79), which compares a small group of nursery 
school children with another small group of non-nursery-school chil- 
dren as they enter and go through kindergarten, may be cited as 
suggesting that the intellectual increment for nursery school children 
at kindergarten age is not great. Only a small and diminishing 
superiority was found for the nursery school group when they were 
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compared with non-nursery-school children of comparable socio- 
economic status. 

A rather vitriolic critique of Wellman’s conclusions has been pub- 
lished by Simpson (89). Specific criticisms are directed primarily 
at the first of Wellman’s publications (104). 

Lamson (46) endeavors to determine to what extent intelligence 
quotients are increased by children who participate in what ig 
described as ‘“‘a vital curriculum—one that considers children’s 
interests and capacities, that requires activity and self-direction on the 
part of children, and that makes progress at the child’s own optimum 
rate possible.” No evidence of gain is found in this school. 

Studies by Hawthorne (34), by Lowry (59), and by Scruggs (86) 
investigate the effect of special training in reading upon intelligence 
test performance. Hawthorne found that pupils from 5 to 12 years 
of age, who were of average intelligence, but retarded in reading, and 
who had improved in reading during remedial instruction at twice 
the normal rate, showed no corresponding improvement on a group 
intelligence test. Lowry, on the other hand, found that 50 children 
who were given three months of intensive reading drill, which pro- 
duced an average gain of 1.36 grades in five reading tests and 2.72 
grades in two speed-of-reading tests, showed a gain from a pre-test 
on Form A to a post-test on Form B of the Otis Intermediate Exami- 
nation of 11.76 points in IQ. There was no control group to take 
care of practice effects or possible inequality of standardization of the 
two tests. 

Scruggs (86) also finds gains in intelligence from special reading 
instruction. A group of fifth-grade negro children who “ followed 
an especially prepared course in reading involving intensive and 
extensive work . . . exercises in vocabulary development, following 
directions, fact getting, recognition of central thought, organization 
and summarization, rate and speed of reading and in manipulation” 
gained more than a control group on a variety of verbal and non- 
verbal intelligence tests. There is reported to be some residual gain 
a year and two years later. 

Durrell (20) finds that children whose reading ability is better 
than would be expected from their Binet IQ do better on paper-and- 
pencil intelligence tests than on the Binet. 


CoNDITIONS OF TESTING 


Finally, we shall make brief mention of a number of studies which 
indicate the effect of certain conditions of testing upon the constancy 
of the resulting 1Q’s. 
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Madsen (61) and Jordan (42) report on the reliability of Binet 
tests given by student examiners, reporting test-retest correlations 
ranging from .65 to .84. 

Mayer (65) and Rust (82) study the effect of negativism upon 
the scores of young children. 

Adkins (1), Dave (16), Lammermann (45), and Snedden (97) 
report evidence of practice effects where the same test is repeated 
or another test is given after a brief interval. 

Benton (7), Ferguson (23), and Maller and Zubin (62) report 
studies in which special incentives were introduced into the testing 
situation. In no case did the special incentives result in significant 
increases in score, as compared to a control group 

Cattell (11) gives evidence of the variation in standard between 
different trained examiners. 

Canady (9) reports a small-scale investigation of the effect of race 
of examiner upon test performance, comparing the performances of 
white and negro children tested by white and negro examiners. 

Lodge (56) reports evidence of seasonal fluctuation in IQ. Analyz- 
ing results for a group of children living in their own home environ- 
ment, tested at six-month intervals, he finds that tests given between 
November 1 and April 30 tend to be higher than those given during 
the other half of the year. It is suggested that some of the apparent 
preschool effect may be due to such a seasonal variation. 
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NOTES AND NEWS 


Dr. Leta S. HoLt_incwortH, a member of the faculty of Teachers 
College, Columbia University, since 1916 and wife of Dr. H. L. 
Hollingworth, professor of psychology at Barnard College, Columbia 
University, died on November 27, at the age of 53, in the Columbia- 
Presbyterian Medical Center in New York City. 


Tue Midwestern Psychological Association will hold its annual meet- 
ings on May 3 and 4 with the department of psychology at the University 
of Chicago. Professor F. C. Kingsbury is acting as local chairman of the 
meetings. 

The presidential address will be given by Professor J. P. Guilford, of 
the University of Nebraska. In addition, a full program of research 
papers and symposia is being scheduled by the Program Committee under 
Professor Fred McKinney, of the University of Missouri. Psychologists 
and others in related fields are cordially invited to attend. 


Tue department of child study at Vassar College has received a two- 
year grant from the General Education Board and a supplementary fund 
from the Josiah Macy, Jr., Foundation for research in the development 
of personality in young children. This work is under the direction of 
Dr. Mary Shattuck Fisher, chairman of the department, who is being 
assisted by Dr. L. Joseph Stone, research editor, and other members of 
the department. Plans for certain work in codperation with Dr. Lyle H. 
Lanier, of the psychology department, are under discussion. There will 
be special emphasis on pictorial methods of presentation, and sound films 
summarizing case studies of normal children will be prepared for use in 
courses in child psychology and education. These will be made available 
through Professor Daniel Prescott, of the department of education at the 
University of Chicago, and the American Council on Education. 


Tue Committee on Scientific Aids to Learning, under the chairman- 
ship of President Conant of Harvard, has made a grant to cover the cost 
of preparing a microfilm master negative, on the most expensive film, of 
sets of volumes of scientific and learned journals. This permits the 
nonprofit Bibliofilm Service to supply microfilm copies at the sole positive 
copy cost of one cent per page for odd volumes, or at a special rate of 
one-half cent per page for any 10 or more consecutive volumes, if properly 
copyable. The number of pages will be estimated on request to the 
American Documentation Institute, care of offices of Science Service, 
2101 Constitution Avenue, Washington, D. C. 


THE response to the announcement of the forthcoming publication of 
the Abstract References of the Psychological Index by the American 
Psychological Association has been encouraging. The Abstract Refer- 
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ences in conjunction with the Psychological Index serve as an exte 
of the Psychological Abstracts back to 1894 by giving references 
abstracts, published in various periodicals, of 43% of the entire literatype 
listed in the Psychological Index from 1928 back to 1894. 3 
Since some institutions possessing the Psychological Index have tm 
yet ordered the Abstract References, it is suggested that psychologigis 
who will be likely to avail themselves of this research-aid inquire of ther 
libraries whether a copy has been ordered. The price is nominal: 
for the first part, containing over one-third of the material, and pro 
$3.00 for the second part, containing the rest. Address orders to Abstragt 
References, 25 Claremont Avenue, New York City. 











